Pii: S0025-5564(01)00096-7
نویسندگان
چکیده
We present a theory of classification and predictive identification of bacteria. Bacterial strains are characterized by a binary vector and the taxonomy is specified by attaching a label to each vector. The theory is developed from only two basic assumptions, viz. that the sequence of pairs of feature vectors and the attached labels is judged (infinitely) exchangeable and predictively sufficient. We derive expressions for the training error and the probability of identification error and show that latter is an affine function of the former. We prove the law of large numbers for identification matrices, which contain the fundamental information of bacterial data. We prove the Bayesian risk consistency of the predictive identification rule given by the theory and show that the training error is a consistent estimate of the generalization error. 2002 Published by Elsevier Science Inc. MSC: 62H30; 92B10; 94A17
منابع مشابه
Pii: S0025-5564(02)00128-1
Mathematical models can help predict the effectiveness of control measures on the spread of HIV and other sexually transmitted diseases (STDs) by reducing the uncertainty in assessing the impact of intervention strategies such as random screening and contact tracing. Even though contact tracing is one of the most effective methods used for controlling treatable STDs, it is still a controversial...
متن کاملPii: S0025-5564(00)00033-x
In this paper, we establish a mathematical model of two species with stage structure and the relation of predator±prey, to obtain the necessary and sucient condition for the permanence of two species and the extinction of one species or two species. We also obtain the optimal harvesting policy and the threshold of the harvesting for sustainable development. Ó 2000 Elsevier Science Inc. All rig...
متن کاملPii: S0025-5564(99)00060-7
Null models for generating binary phylogenetic trees are useful for testing evolutionary hypotheses and reconstructing phylogenies. We consider two such null models ± the Yule and uniform models ± and in particular the induced distribution they generate on the number Cn of cherries in the tree, where a cherry is a pair of leaves each of which is adjacent to a common ancestor. By realizing the p...
متن کاملPii: S0025-5564(00)00005-5
We consider a single-species model which is composed of several patches connected by linear migration rates and having logistic growth with a threshold. We show the existence of an aggregating mechanism that allows the survival of a species which is in danger of extinction due to its low population density. Numerical experiments illustrate these results. Ó 2000 Elsevier Science Inc. All rights ...
متن کاملPii: S0025-5564(01)00107-9
We outline and describe steps for a statistically rigorous approach to analyzing probe-level Affymetrix GeneChip data. The approach employs classical linear mixed models and operates on a gene-by-gene basis. Forgoing any attempts at gene presence or absence calls, the method simultaneously considers the data across all chips in an experiment. Primary output includes precise estimates of fold ch...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002